Draft version January 31, 2013 

Preprint typeset using I^T^X style cmulatcapj v. 12/16/11 



WHAT ARE THE PROGENITORS OF COMPACT, MASSIVE, QUIESCENT GALAXIES AT Z = 2.3? 
THE POPULATION OF MASSIVE GALAXIES AT Z > 3 FROM NMBS AND CANDELS 

Mauro Stefanon 1,2 , Danilo Marchesini 3 , Gregory H. Rudnick 1 , Gabriel B. Brammer 4 & 

Katherine E. Whitaker 5 

Draft version January 31, 2013 

ABSTRACT 

Using public data from the NEWFIRM Medium-Band Survey (NMBS) and the Cosmic Assembly 
Near-Infrared Deep Extragalactic Legacy Survey (CANDELS), we investigate the population of mas- 
sive galaxies at z > 3. The main aim of this work is to identify the potential progenitors of z ~ 2 
compact, massive, quiescent galaxies, furthering our understanding of the onset and evolution of mas- 
sive galaxies. Our work is enabled by high-resolution images from CANDELS data and accurate 
photometric redshifts, stellar masses, and star formation rates (SFRs) from 37-band NMBS photome- 
try. The total number of massive galaxies at z > 3 is consistent with the number of massive quiescent 
galaxies at z ~ 2, implying that the SFRs for all of these galaxies must be much lower by z ~ 2. We 
discover four compact, massive, quiescent galaxies at z > 3, pushing back the time for which such 
galaxies have been observed. However, the volume density for these galaxies is significantly less than 
that of galaxies at z < 2 with similar masses, SFRs, and sizes, implying that additional compact, 
massive, quiescent galaxies must be created in the intervening ~ 1 Gyr between z = 3 and z = 2. 
We find five star- forming galaxies at z ~ 3 that are compact (R e < 1.4 kpc) and have stellar mass 
M» > 10 10 6 M©; these galaxies are likely to become members of the massive, quiescent, compact 
galaxy population at z ~ 2. We evolve the stellar masses and SFRs of each individual z > 3 galaxy 
adopting five different star formation histories (SFHs) and studying the resulting population of mas- 
sive galaxies at z — 2.3. We find that declining or truncated SFHs are necessary to match the observed 
number density of massive, quiescent galaxies at z ~ 2, whereas a constant SFH would result in a 
number density significantly smaller than observed. All of our assumed SFHs imply number densities 
of compact, massive, and quiescent galaxies at z ~ 2 that are consistent with the observed number 
density. Better agreement with the observed number density of compact, massive, quiescent galaxies 
at z ~ 2 is obtained if merging is included in the analysis and better still if star formation quenching 
is assumed to shortly follow the merging event, as implied by recent models of formation of massive, 
quiescent galaxies. 

Keywords: galaxies: high-redshift, galaxies: compact, galaxies: evolution, galaxies: fundamental pa- 
rameters 



1. INTRODUCTION 

The population of galaxies in the Local Uni- 
verse presents a clear bi-modal ity, as evidenced 
by color-magnitude d iagrams (e.g iBaldrv et all [200l 
Kauffmann ct al. 2004) , with galaxies either living on the 
red sequence or in the blue cloud. This bi-modality is fur- 
ther supported by tight correlations bet ween the main 
physical properties of each class (e .g. iTullv fc Fisherl 
119771: iKormendv fe Djorgovskil I1989T ). Galaxies on the 
red sequence typically are massive and quiescent (i.e. 
with low or no ongoing star formation) with early-type 
morphologies, while galaxies in the blue cloud are less 
massive, with higher star formation rates (SFRs) and 
have spiral or irregular morphologies. 
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The physical mechanisms driving the onset of the ob- 
served bi-modality are one of the main open issues in the 
study of galaxy formation. At high redshift the obser- 
vational picture is complicated by the lack of a common 
definition for massive, quies cent galaxies (MQG s) in the 
literature (see discussion in lSaracco et al.ll2012l ). 

Samples selected according to the spectral energy dis- 
tribution (SED) show that massive (i.e. stellar mass 
M* > IO^Mq) and quiescent (i.e. specific star forma- 
tion rate sSFR < 10 -11 — 10~ 10 yr~ 1 ) galaxies were al- 



eady in place at z ~ 2 dFranx et al. 120031 : iDaddi et al.l 



ready 

[200alKriek et~all[200l ICimatti et al.ll200"i . Their num 
ber density has grown by almost a factor of 10 since z = 3 
(iLabbe etall 120051: lArnouts et al.l 120071; iFontana et al 
2009[lTavlor et al.ll2009[ lllbert et al.ll2010HCassata et al 
20111 : iBrammer et al.l 120111 : iDommguez Sanchez et al 



20 lit iBell et all 120111 ) although most of t he evolution 



occurred at z > 1 (e.g. iPozzetti et al.l I2007T) . 

In addition to rapid number density evolution, the sizes 
of MQGs have evolved dramatically from high redshift to 
the present day. Most of MQGs at z > 1.5 are also com- 
pact systems: the observed effective radii are generally 
a factor of ~ 3 — 5 smaller at z ~ 2. 5 (IDaddi et al.ll2005b 
lLonghetti et~aTl 120071: iToft et al.l 120071 iTruiillo et al.l 
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20071 Ivan Dokkum et al l 12008. iCimatti et all 
20081 ISaracco et all 1200a Ivan der Wei et al.l 120081: 



Bezanson et al. I 120091: iSzomoru et al.l 12012ft than at 



z = 0, where such compac t galaxies are almo st entirely 
absent (|Truiillo et al.ll200l iTavlor et al.ll2010h . 

There have been multiple attempts to explain the evo- 
lution of these objects and to place them into a broader 
cosmological context. In the so-called monolithic col- 
lapse scenario, MQGs would have assembled almost all 
of their stellar mass at high redshift, followed by a passive 
evolution of the stellar population. This class of models, 
however, foresees little or no evolution in size, in contrast 
to observations. More recent models describe the forma- 
tion of MQGs as a two-stage process: gas-rich merger 
events generate compact, massive spheroids at z > 3, 
while minor, dry (i.e. without formation of new stars) 
mergers at later cosmic times would increase the size, 
while responsible for only a small increase in s tellar mass 
(INaab et al.l [20071 1200a IWuvts et al l f2010t IQser et al.l 
120121 ). Indeed. iBezanson et al.l (|200l showed that the 
stellar mass density of the inner ~ lkpc of z ~ 2 galax- 
ies does not substantially differ from that in prcscnt-day 
MQGs, suggesting that the growth during the second 
phase progresses from the inner region rapidly towards 
the external parts (the so-ca lled inside-out growth - see 
e.g. Ivan Dokkum et alj|20ldh . 

However, the picture is still far from being completely 
clear: detailed comparisons with ACDM models of dry 
merging show that some of the models predict descen- 
dants of z > 2 too compact com pared to the observed 
local MQGs (|Cimatti et al.|[2~012D . Recent studies have 
also revealed the existence of a large number of MQGs 
at z ~ 1.5 with sizes similar to those of the local galax- 
ies with comparable mass (|Saracco et al.l [200l l20lH : 
iMancini et~aLl 120101 ) . If the population of MQGs at 
z > 2.5 includes only compact systems, this would imply 
a size evolution timescale of ~ 1 Gyr, chall enging the cur- 
rent models of formation of local MQGs (|Saracco et al.l 
120101) . 

The main goal of this work is to identify, within an 
observational framework, the population of galaxies at 
z > 3 which could give rise to the population of compact, 
MQGs observed at z = 2.3. We combine data from two 
public s urveys, the NEWFIR M Medium-Band Survey 
(NMBS. IWhitaker et ai1l2011ft and the Cosmic Assembly 
Near-i nfrared Deep Extragalactic Legacy Survey ( CAN- 
DELS. iGrogin et al.l[201iriKoekemoer et al.ll2011l ). The 
NMBS provides accurate measurements of photometric 
rcdshifts (z p hot), stellar masses (M*) and star-formation 
rates (SFR), while robust morphologies and sizes were 
measured from the high-resolution imaging from CAN- 
DELS. 

In Section[2j we describe the data sets used in our work, 
the measurements of the sizes, and the completeness in 
stellar mass. The results are presented in Section [3J We 
discuss our results in Section 01 and summarize them in 
Section 5. 

Throughout this work, we use a concordance cosmol- 



ogy with Hq = 70 Km s Mpc 



Sir 



0.3 and 



£!a = 0.7. All magnitudes are referred to the AB sys- 
tem. 

2. DATA 



We combined the photometric depth and the ac- 
curate measur ements of photometric r edshifts offered 
by th e NMBS (|van Dokkum et "ail 1200a I Whitaker et all 
l20ll with the high-resolution HS T/WFC3 deep NIR 
imaging from the C ANDELS (|Grogin et al.l I20TTI: 
IKoekemoer et al.l2011l ) . These two surveys will be briefly 
described in the next two sections. 

2.1. NMBS-COSMOS 

The NMBS covers two fields: a 27.6" x 27.6 " pointing 
within the COSMOS field (|Scoville et al.l[2007h and a sec- 
ond pointing, of the same size, overlapping with part of 
the All- Wavelength E xtended Groth Str ip International 
Survey (AEGIS) strip (jDavis et al.H2007ft . Only the data 
from the COSMOS field was used, since the overlap be- 
tween NMBS- AEGIS and CANDELS is minimal; using 
both fields the average improvement in cosmic variance 
would have been just 5%. 

NMBS uses five medium-bandwidth filters in the NIR 
wavelength range 1 — 1.8 /im. The bluest filter is similar 
to the Y band, the canonical J and H bands are split 
into two filters each. Such configuration pinpoints the 
location of the re dshifted Balmer/4000 Abr eaks in 1.5 < 
z < 3.5 galaxies (van Dokkum ct al. 2009) and enables 
photometric redshift measurements with an accuracy of 
02/(1 + z) ~ 2% for objects i n the redshift range 1.5 < 
z < 3.5 (IWhitaker et al.ll20rl . 

The full public c atalog for the COSMOS field 
(|Whitaker et al.ll20llFI provides UV-to-24 fim photom- 
etry for 31306 K-selected sources based on the de- 
blended version, along with accurate photometric red- 
shifts, rest-frame luminosities, star formation rates, and 
stellar mass measurements. The redshifts and stellar 
masses were computed using 37 filters from the COS- 
MOS fields, combining the NIR medium-bandwidth data 
with existing UV (Galaxy Evolution Explorer), visible 
and NIR (Canada-France-Hawaii Telescope and Sub- 
aru Telescope), and mid-IR (Spitzer/IRAC) imaging 
(|Whitaker et aUl20ll . 

The catalog is complete at the 75% de tection level for 
magn itudes brighter than K$ = 23.1AB ([Whitake r et al.l 
120111) : we selected objects brighter than Kg = 23.4 mag, 
corresponding to a detection completeness of 50%. 

2.2. CANDELS 

CANDELS is a 902-orbit Hubble Space Telescope 
(HST) Multi-Cycle Treasury program aimed at probing 
the evolution of galaxies and black holes from z ~ 1.5 to 
8 and at detecting and studying Type la supernovae at 
z > 1.5 in order to better con strain the nature of dark 
energy (|Koekemoer et al.ll20TTI ). 

The Wide portion of the survey covers a total of ~ 800 
square arcminutes down to H ~ 26.5 mag, spread over 
five fields: extended regions around the t wo GOODS 
fields , the Exte nded Groth Strip (EGS: IDavis et all 
12007ft . COSMOS (IScoville et al.ll200"7ft . and the UKIDSS 
Ultra-Deep Survey fUDS: lLawrence et al.ll2007ft . 

We used the F160W as the filter for the measurement 
of the morphological parameters as it is the reddest 
band available with high-resolution imaging. At redshift 



be 



downloaded 



6 Catalogs can 

http://www.astro.yale.edu/nmbs/Data_Products.html 
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ID: 9083 
z: 2.823 
log[M*]: 11.160 
log[sSFR]: -9.000 
SFR: 144.544 
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Figure 1. Example of the need for high- resolution images such as those from CANDELS in identifying blended objects in lower resolution 
images. From left to right: a) Tile from the F160W CANDELS image; b) Segmentation map from NMBS (grey scale) and from the F160W 
CANDELS frame (colored spots); the purple cross indicates the center of the NMBS source, which is the center of the frame; the axis 
values are in arcseconds relative to the center of the tile; c) Residual image from GALFIT after fitting all sources simultaneously; the pixel 
scale for tiles a), b) and c) is the same, d) Observed SED from NMBS (black points with error bars) and SED fit from EAZY (solid red 
line). The blended objects like the one here plotted were excluded from the analysis. 




Figure 2. Observed SED from NMBS (black points), EAZY best- 
fitting SED (solid red curve) for a possible AGN. A color cutout 
created from the F814W, F125W and F160W CANDELS filters 
is shown in the top-left corner, while a cutout from CANDELS 
F160W image is presented at the bottom left. The angular size 
for both cutouts is 3.7" X 3. 7". Objects like the one plotted were 
excluded from the sample. 

z ~ 3, the F160W filter corresponds to the rest-frame 
B-band. 



2.3. Sample selection 

The Full Width Half Maximum (FWHM) of the Point- 
Spread Function (PSF) on the CANDELS F160W im - 
ages is FWHM- 0.17" - 0.19" (|Koekemoer et al.ll20Tl . 
a factor of -6 better than the fsT-band PSF in NMBS. 
This fact, together with the higher photometric depth 
compared to the NMBS survey, translates into a higher 
surface brightness sensitivity than NMBS. 

From the full NMBS catalog we extracted those non- 
stellar objects and objects with a good quality flag 
(star-flag=0 and use=l). We selected galaxies compat- 
ible with being at z > 3 at the 1-sigma level (i.e. u_68 
> 3.). In this way, we selected a total of 613 galaxies. 

The overlap of the CANDELS COSMOS image and the 
NMBS COSMOS field amounts to 192 square arcmin, 
- 20% of the original NMBS field. This reduces the 
sample of z > 3 galaxies available to the measurement of 
the morphological parameters to 133 sources. 



The availability of CANDELS data allowed us to ac- 
curately measure the morphological parameters of our 
z > 3 galaxies, and to identify potentially blended ob- 
jects, which appear as single sources in the NMBS cata- 
log. In Figure [T] we show an example of the importance 
of high-resolution imaging, which allowed us to identify 
those cases in which blended and/or very close objects 
could affect the measurement of the SED, with conse- 
quently unreliable photometric redshifts and stellar pop- 
ulation properties (e.g. stellar masses, SFRs). For these 
reasons, blended objects were excluded from our final 
analysis. We also visually inspected the 133 SEDs of the 
z > 3 galaxies and excluded from the sample those show- 
ing the presence of a possible AGN ( an example of which 
is presented in Figure [2]). The fraction of such objects in 
the whole sample of 133 galaxies summed to 4%. 

The final catalog includes 110 galaxies. The distribu- 
tion of M* with Zphot for the final sample is presented 
in Figure [3] The median redshift and redshift uncer- 
tainty for the full sample are ztot = 3.2 and 0.1 respec- 
tively; similarly, for the M* > lO n M0 subsample, we 
have Zm = 3.2 ± 0.3. The plot shows also that out of the 
10 objects with redshift compatible with z ~ 2.3, none 
of them has a stellar mass M* > IO^Mq, increasing our 
confidence in the adopted redshift selection criteria. 

2.4. Size measurement 

The luminosit y profile of each galaxy was fit by a sin- 
gle ISersid (119681) profile, using the GALFIT (jPeng et al.l 
120021 120101 ) program. This code fits two-dimensional an- 
alytic functions directly to images after convolving the 
profile with a PSF. All size measurements were done on 
the F160W CANDELS image. 

A zero-order set of m orphological parameters was ob- 
tained via SExtractor (|Bertin fc Arnoutsl 119961) on the 
F160W image. In order to make the measurements of the 
morphological parameters with GALFIT as robust and 
reliable as possible, two fundamental steps were taken 
into account. The first is the construction of the PSF, 
while the second is the evaluation of the sky background 
level. 

The PSF was constructed with IRAF/Daophot 
(|Stetsonlll987f) . from a set of unsaturated, bright and iso- 



4 



Stefanon et al. 



12.0 



11.5 



11.0 



V 10.5 




10.0 



Figure 3. Stellar mass as a function of redshift for the 110 galax- 
ies constituting our final sample (blue filled squares), after clean- 
ing for blended objects (grey filled circles) and possible AGN (red 
filled diamonds). The error bars in redshift encompass the 68% 
confidence level, as estimated by EAZY. The median redshift of 
M, > 10 11 Mq galaxies is 3.2, and it is marked by the vertical 
dashed line. All JW* > 10 Mq galaxies, selected to possibly lie at 
z > 3, have a redshift z > 2.5 at a 68% confidence level. This gives 
us confidence that our selection is not including z ~ 2 galaxies with 
very broad redshift probability distributions. 
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Figure 4. Specific star formation rate as a function of R e . Associ- 
ated errors on R e are from GALFIT. Points are color-coded accord- 
ing to their stellar mass. The sample defined by log(M* /Mq) > 11 
and marked in the plot by the large red filled squares, is complete 
in stellar mass at the 70% level. As such, this is the sample we 
adopt to draw our conclusions. The vertical grey region marks the 
limit t o which we c an resolve objects, determined by fitting stars 
with a Sersic ( 1968) profile. We note that, given the existence of 
a size-mass relation, the vertical line at R e = 2 kpc marking the 
separation between compact and extende d ga laxies only applies to 
log(M„/MQ) > 11 galaxies (see Section 13.21 for a more complete 
discussion). The inset shows the cumulative fraction of galaxies 
according to their effective radius R e for the full sample (solid 
black line), for the massive (log(M t /Mq) > 11) galaxies (dotted 
red line) and for the compact (R e < 2 kpc), massive, quiescent 
(log(sSFR,/yr _1 ) < —11) sample (dashed purple line). A vertical 
grey line marks our limit in R e for the compact galaxies. Ap- 
proximately 70% of the galaxies in our (flux-limited) sample have 
R e < 2 kpc; 4 out of the 10 massive galaxies are also compact. No 
extended quiescent galaxies are present, although this could be the 
result of a surface brightness effect. Star-forming galaxies show no 
evident correlation between R e and sSFR. 



lated stars, and using a gaussian analytic function plus 
a look-up table built by the IRAF psf task from the 
residuals of the function fitting. This allowed us to take 
into account the anisotropies of the brightness profile, 
especially those in the wings of the PSF. 

The SExtractor sky measurement is based on thresh- 
olding: the sky is measured by determining when the 
gradient of the flux flattens out enough compared to the 
background noise. In particular this means that SEx- 
tractor will generally over-predict the sky with the effect 
of suppressing the Sersic index, the effective radius, and 
luminosity. In our analysis, the background level was 
estimated from its median value in 30 non-overlapping 
boxes 50 pixels (3") wide, distributed across a region of 
400 pixels (24") around the central object and free from 
any other source, as probed by the SExtractor segmenta- 
tion map. The region size is wide enough to grant robust 
statistics and at the same time it is small enough to miti- 
gate possible gradients in the sky background. The boxes 
were kept to an additional distance of 0.6" from the seg- 
mentation map, increasing the confidence in excluding 
contamination from the outskirts of objects. 

All objects in a box of 150 pixels (9") around each 
galaxy were simultaneously fit, in order to take into ac- 
count possible contamination of the brightness profile 
from neighboring objects, which could bias the measured 
size and Sersic index. 

The effective radii were finally circularized and con- 
verted to physical units using the adopted cosmology. 

The resulting distribution of R e as a function of the 
sSFR for the galaxies of our sample are presented in Fig- 
ure m 

2.5. Stellar Mass Completeness 

Consistent statistical measurements of intrinsic physi- 
cal quantities rely on the accurate characterization of the 
selection effects, i.e. on the completeness of the sample. 

The galaxy stellar mass completeness for a flux-limited 
sample not only directly depends on the limiting flux it- 
self, but also on the mass-to-light ratio (M/L) of each 
galaxy. All else being the same, galaxies with lower 
M/L values will be probed to lower mass limits; simi- 
larly, it is possible to probe only the higher stellar mass 
region for those object with higher M/L ratios (see e.g. 
iMarchesini et al.ll2009l) . 

The above effect makes the measurement of the stellar- 
mass completeness a non trivial tas k; common ap- 
proac hes involve either SSP modelling (Dic kinson et al.l 
I2003D and/or co mparing the galaxy pop ulations to exist- 
ing deeper data ([Marchesini et al.ll2010l ). 

The average PSF on NMBS frames is ~ 1.1", corre- 
sponding to an R e of ~ 0.6", or ~5 kpc at z = 3.2. In 
particular this means that objects with R e < 5 kpc are 
essentially point sources on the NMBS frames. The in- 
set in Figure 0] shows that the totality of the galaxies 
in our sample has R e < 5 kpc. We therefore adopt, for 
each galaxy, the if —band completeness curve measured 
for point sources for t he NMBS survey and presented in 
IWhitaker etaLl (l2fjll . 

The completeness of the z > 3 sample as a function 
of stellar mass was analyzed for different ranges in M/L 
ratio. Objects with log(M*/M Q ) > 11 are complete at 
the ~ 70% level. A graphical representation is plotted in 
Figure OH which shows the log(M*/M Q ) vs. M/L ratio 



The Progenitors of z ~ 2 Compact, Massive, Quiescent Galaxies 



5 




jjj^v:--. ------ » 



□ □ □ □ □ 

<0.5 1.0 2.0 3.0 5.0 Kpc 
v No CANDELS data 

log(M,/M )>1 1 ■ 
10.5<log(M,/M o )<1 1 ■ - 
10.0<log(M,/M G )<10.5 ■ 
log(M,/M e )<10.0 ■ 



M/L [M s /L av ] 

Figure 5. Stellar mass as a function of the rest-frame V-band 
mass-to-light ratio for all the NMBS galaxies with redshift com- 
patible with being at z > 3. Points are color-coded according 
to their stellar mass. Objects covered by the CANDELS F160W 
frame which allowed for size measurement are marked by larger 
symbols, with the symbol size proportional to the R e . The 50% 
completeness level, c omputed adopting the if-band detection com- 
pleteness curve from [Whitakcr ct al. (2011), is marked by the yel- 
low dashed curve, while the 80% completeness level by the cyan 
dash-dotted curve. The vertical arrow marks the M/L of a single 
stellar population passively evolved to z = 3.2, smaller than the 
maximum value we observe for the massive population, increasing 
our confidence on our completeness analysis. The completeness of 
the M„ > 10 n iW" Q galaxies is > 70%. 



diagram for all NMBS galaxies at z > 3, as well as the 
50% and 80% completeness curves of the stellar mass as 
a function of M/L ratio derive d from the ff-b a nd de - 
tection completeness curve from IWhitaker et al.l (j2011). 
Figure [5] shows that for galaxies with \og(M*/Mo) > 11, 
the completeness is > 70%. Also plotted is the M/L at 
z = 3.2 of a passively evolving single stellar population 
formed at Z{ OTm = 20. Its value, equal to 0.78M©/L Qi y, 
well below the maximum value we observe for the massive 
sample, provides additional confidence in the adopted 
completeness limits in stellar mass. We note further- 
more that the completeness in stellar mass as a function 
of redshift is fairly flat over the redshift range targeted 
in our work, changing by, e.g., ~ 0.1 dex from z = 4 to 
z = 3 for a single stellar population formed at Z{ olm = 10. 
In our analysis we then selected only those galaxies with 
log(M*/M ) > 11 (for a total of 10 objects), for which 
small completeness corrections are required; specifically, 
the correction for incompleteness ranged from 1.05 to 1.5. 

In subsequent sections we will evolve our z ~ 3 sample 
to z ~ 2 to assess its correspondence to the z ~ 2 ob- 
served population. In principle we should also estimate 
completeness levels in stellar mass for this evolved popu- 
lation. This could be done in a way similar to that imple- 
mented for the z > 3 sample. In particular, this means 
that, for each object, the final completeness correction 
for the z ~ 2 sample would be the product between the 
completeness correcti on at z ~ 3 and that at 2 ~ 2. 
However, as shown bv lBrammer et ahl (| 2 1 If ) . the stellar 
mass completeness at z ~ 2 is 95% or better for galaxies 
with stellar mass log(M«) > 11, which coincides with our 
stellar mass selection limit. This implies that no further 
correction is needed for the massive z ~ 2 sample. 



2.6. Cosmic Variance 

Given the small region of overlap between the NMBS- 
COSMOS and CANDELS-COSMOS fields, the statisti- 
cal uncertainties associated with fluctuations of the large- 
scale density (i.e. the cosmic variance) are playing a 
non negligible role. We measured such effects following 
the recipe by Moster c t al.1 (|2011| ). A halo distribution 
model is used to relate the stellar mass to the dark mat- 
ter halo as a function of redshift; the galaxy bias is then 
estimated via dissipationless N-body simulations. The 
cosmic variance is first computed on dark matter haloes, 
and then converted to galaxy cosmic variance by apply- 
ing the galaxy bias. The average relative error due to 
cosmic variance for the z > 3 massive sample is 35%. 
This value was added in quadrature to the Poisson er- 
rors in the number densities we computed. 

2.7. The sample 

Our primary sample is composed by those galaxies 
brighter than Ks = 23.4AB, whose redshift is compatible 
with z > 3, with stellar mass M* > 10 "Mo, and which 
lie on the CANDELS-COSMOS frame. This selection 
yields 10 galaxies. Given that their associated complete- 
ness is ~ 70%, this is the only sample which allows us 
to perform a quantitatively robust analysis. Specifically, 
all number densities quoted in this work are based on 
the above sample. Their SEDs are shown in Figure |H1 
together with the F160W and color cutout. 

3. RESULTS 
3.1. Massive z > 3 galaxies 

The distribution of the specific star formation rate 
(sSFR=SFR/M*) as a function of the effective radius R e 
for the 3 < z < 4 sample is shown in Figure [4] Points are 
color-coded according to their stellar mass. The effective 
radius from a Sersic profile fit to the point-source objects 
is 0.11 ± 0.01"; its projection to 3 < z < 4 corresponds 
to ~ 0.8 kpc and is marked by the solid grey region, 
identifying a limit for the PSF-convolved Sersic profile. 
This marks an empirically determined limit below which 
we cannot robustly determine the size. Given our stellar 
mass completeness measurement, galaxies satisfying the 
70% completeness limit coincide with those whose stellar 
mass M* > lO n M0, identified by the red squares. 

In the following, if not otherwise specified, we refer 
to massive galaxies as those with stellar mass M* > 
lO n M0; we define compact galaxies as those with 
R e < 2 kpc and quiescent galaxies as those whose 
sSFR< 10~ n yr -1 . Our de finition of q u iescen t galax- 
ies is based on the work of iKriek et al.1 (|2006l ). where 
a population of massive quiescent galaxies is spectro- 
scopically identified at z ~ 2.3. In their sample, 



the average sSFR is 4 x 10 



yr 



with all galax- 



ies having log(sSFR/yr ) < —11. For galaxies of 
log(M*/M ) = 11, this corresponds to SFR< 1M 



While galaxies with sSFR> 10 10 yr 1 can be 



reasonably considered star-forming, those with interme- 
diate sSFR (-11 < log(sSFR/yr~ 1 ) < -10) are galax- 
ies with suppressed star-formation activities (relative to 
their stellar masses) with respect to the star-forming se- 
quence observed at z ~ 2 (e.g. IWhitaker et al.l 120121 : 
ISzomoru et al.|[2012T ). although not necessarily quiescent. 
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Figure 6. Observed SED from NMBS (black points) together with EAZY best-fitting SED (solid red curve) of our sample of galaxies 
with log(M*M ) > 11 A color cutout cen tered on the object and based on the F814W, F125W and F160W filters from CANDELS, with 
color scheme following Lupton ct al. (2004), is presented in the upper part of each plot, while a cutout from the F160W filter is shown at 
the bottom of each plot is. The MIPS det. label indicates those objects with a 3 — cr MIPS detection (see text for details). The angular 
size of each cutout is 3.7" X 3.7". 



We therefore adopt a more conservative definition of 
quiescent galaxy (i.e., sSFR< 10~ n yr _1 ) in place of 
sSFR< 10~ 10 yr , sometimes used in the literature. 
We also checked Spitzer/MIPS 24 /im data relative to 
each massive galaxies. All but one star-forming galax- 
ies are detected at more than 3a. No significant MIPS 
detection is found for 2 of the 4 massive quiescent galax- 
ies, consistently with their quiescent nature from SED 
modeling. However, two quiescent massive galaxies have 
significant MIPS fluxes (~ 80 and ~ 140^ Jy), which 
would imply large sta r formation rates (700-900 Moyr" 1 ; 
iWhitaker et al.ll2012D if the 24/xm emission were associ- 
ated to dust-enshrouded star formation. We note how- 
ever that at z > 3, the MIPS 24/im band samples rest- 
frame wavelengths shorter than 6/im, i.e., emission from 



hot dust. As such, the conversion from mid-IR fluxes to 
SFRs is very uncertain, especially at the high redshifts 
targeted in our work. Moreover, additional contamina- 
tion due to the emission from the dusty torus of an AGN 
can potentially contaminate the MIR fluxes probed by 
the MIPS 24/im band. For these reasons, we will use 
the SFRs derived from the SED modeling throughout 
the paper, noting that half of the sub-sample of mas- 
sive quiescent galaxies could actually be highly obscured, 
star- forming galaxies. Observations in the far-IR (e.g., 
ALMA) are needed to robustly quantify the level of ob- 
scured star formation and to confirm the quiescent nature 
of these galaxies. 

The data show that at z > 3, M* > 10 11 M Q galaxies 
that are quiescent tend to be compact, while those that 
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are star-forming are more extended, similarly to the dis- 
tribution found at z ~ 2. Most of the galaxies (~ 70% - 
see the inset in Fig. [4j are characterized by R e < 2 kpc, 
with a median of 1.5 kpc. However we caution that these 
values refer to the full sample which is not complete in 
stellar mass. The average size for the MQGs in the stel- 
lar mass complete sample is 1.2 kpc (0.6 kpc pc if the two 
galaxies with MIPS 24 /im detection are excluded); the 
corresponding value for the massive, star-forming sample 
is 3.1 kpc. 

The plot also shows a lack of extended (R e > 2 kpc) 
quiescent galaxies, indicating that the massive quies- 
cent (elliptical) galaxies observed in the Local Universe 
were not yet completely formed when the Universe was 
< 2 Gyr old. This could also be a surface bright- 
ness selection effect, although the width of NMBS PSF 
compared to CANDELS PSF causes all objects with 
i? e < 5 kpc to be detected as point-sources; the selec- 
tion effect should then act only on the very extended 
objects. 

The plot in Figure [4] shows that MQGs were already 
present at z > 3, con firming previous results (see e.g. 
iMarchesini et a l. 2010 which first characterized the prop- 
erties of a mass-complete sample of 3 < z < 4 galax- 
ies, finding both quiescent and star- forming galaxies). 
For M* > 10 n Af Q and sSFR < 10~ n yr" 1 we count 
four galaxies, corresponding to a completeness-corrected 
number density of uq — 5.2^f^ x 10~ 6 Mpc~ 3 (uq = 
3.6±f° 4 x 10~ 6 Mpc" 3 if the two galaxies with MIPS 24 
fim detection are excluded) . The co- moving volume was 
computed assuming a redshift range 2.8 < z < 4.0. Up- 
per and l ower error bars were computed following the 
recipe by iGehrelsl (|1986h : cosmic variance was finally 
added in quadrature. 

Figure 0] clearly shows that compact, MQGs were al- 
ready present at z > 3. This is one of the main re- 
sults of this work. Studies so far have found this popu- 
lation of galaxies up to z ~ 2 (see e.g. iKriek et"al"]l2006l : 
iBezanson et al.ll2009H . Our data allows us to push back 
in time the appearance of compact, massive, quiescent 
galaxies from when the Universe was w 3 Gyr old to an 
age of less than 2 Gyr. 

As we will show in Sec. 14.31 these objects do not exist 
in significant enough numbers to explain the abundance 
of compact, massive, quiescent galaxies (CMQGs) at z ~ 
2. 

3.2. Compact, star-forming galaxies at z ~ 3 with large 
stellar masses 

The exis tence of a relati o n between AL an d R e 
(see e.g. iShen et all [200l iMosleh et all l20TTh im- 
plies that we can not consider a single value for the 
R e when selecting compact galaxies at different stel- 
lar mass ranges. Assuming R P tx Af°' 32 , valid for 
galaxies at 2.5 < z < 3.5, (jMosleh et all 1201 ll) . an 
R e = 2.0 kpc for a log(AL/Af ) = 11 galaxy scales to 
R e = 1.4 kpc for log(AL/Af ) = 10.5 and R e = 1.0 kpc 
for log(AL/M ) = 10.0. Figure H then shows that there 
are a number of compact star-forming galaxies, span- 
ning the whole range of stellar masses. In particular, and 
more interestingly, in our sample there are 5 galaxies with 

high sSFR (log(sSFR/yr ) 9), high stellar masses 

(10.6 < log(AL/Af ) < 11) and with R e < 1.4 kpc (i.e. 



they are compact). Their redshift is above z = 2.6 with 
68% confidence level, excluding contamination from low 
redshift galaxies with broad probability distributions of 
photometric redshifts. From Figure [5l the stellar mass 
completeness of log(AL/Af ) ~ 10.6 galaxies is lower 
than 50% for M/L > O.9M /L 0! y, i.e. we could possi- 
bly be missing star-forming galaxies with significant dust 
absorption. In Figure[7]we present their SEDs along with 
F160W and color cutouts. We note however, that one out 
of the 5 objects has both 5 and 8 /jm excess that could 
be the imprint of a Type 2 AGN. However, there is no 
evidence for this AGN in the rest-frame optical SED and 
we therefore do not think that it is causing a significant 
bias in our sizes, although obscured AGN are a potential 
source of uncertainty. 

To determine if there is a faint extended component in 
the compact star-forming galaxies and to better assess 
our size measurements, we directly stacked the images 
for the 5 objects. We normalized each image tile to its 
peak flux to prevent any single object from dominating 
the stack. The brightness profile of the stacked image was 
analyzed using GALFIT and a Sersic profile, obtaining 
R e = 0.53" ±0.02" and n = 3.5 ±0.2. Assuming an aver- 
age redshift of z ~ 3, this corresponds to R e ~ 7.7 kpc. 
However, we note that visual inspection of the images of 
the individual objects shows that the aspect ratio and 
position angle of two galaxies are very different from the 
other three, causing significant broadening of the profile. 
We therefore repeated the stacking excluding these two 
sources, obtaining R e = 0.12" ± 0.01" (0.9 kpc at z ~ 3) 
and n = 1.7 ±0.2, further supporting their compact con- 
figuration. The stacked image and its surface brightness 
profile for this second analysis are shown in Figure [51 

These compact, star-forming galaxies with high stellar 
mass are likely candidates for being the progenitors of the 
compact, MQGs observed at z ~ 2. However, as we will 
show in the next sections, the number density associated 
to this sample of objects cannot explain by itself the 
number densities observed for the compact, MQGs at 
z ~ 2. 

4. WHAT WERE THE PROGENITORS OF THE Z ~ 2.3 
COMPACT, MASSIVE, AND QUIESCENT GALAXIES? 

4.1. Evolving z > 3 galaxies to z = 2.3 

In this section we describe the evolution of the z > 3 
population of massive (AL > lO u Af ) galaxies down to 
z = 2.3, paying specific attention to the progenitor pop- 
ulation of the mas sive, quiescent (sSFR < 10~ n yr" 1 ) 
galaxies at z ~ 2.3 (IKriek et alj|2006t Ivan Dokkum et al.l 
l2008t IBezanson et al.ll2009D . We used the stellar masses 
and SFRs from the public NMBS c atalog. The s e wer e 
computed using th e FAS T code (jKriek et al.l I2009T ). 
adopting a iKroupal ()2001h IMF, a delayed-exponential 
star formation history (SFH) and solar metallicity. 

The stellar mass and the SFR of each z > 3 galaxy 
was evol ved to z = 2.3 using the GALAXEV program 
from the lBruzual & Charlotil200l SSP models using a 
set of different SFHs. Specifically we adopted 5 distinct 
SFHs: a constant SFR (CSF), an exponentially declining 
SFR (E), an exponentially declining with quenching of 
star formation lOOMyr after the observed redshift SFR 
(TE) , a delayed-exponential SFR (DE) , and its quenched 
version lOOMyr after the observed redshift (TDE). We 
allowed each SFH to progress starting at the mass and 



Stefanon et al. 



z:2.9 

Re [kpc]:0.77 
log(M./M o ):l0.7 
log(sSFR/yr-'):-8.9 
V'-5 




z:3.1 

Re [kpc]:0.99 
log(M./M e ):1 1.0 
log(sSFR/yr-'):-9.J 
A,:0.9 



A [fi m] 




z:3.0 

Re [kpc]:1.04 
log(M./M o ):10.9 
log(sSFR/yr _, ):-9.5 
\:0A 




z:2.9 

Re [kpc]:1.08 

log(M./M o ):10.6 

log(sSFR/yr"'):-S.6 



A [fj, m] 




z:3.0 

Re [kpc]:1.15 
log(M./M o ):10.6 
log(sSFR/yr-'):-8.7 
V'.2 




Figure 7. Observed SED from NMBS (black points), EAZY best-fitting SED (solid red curve), color cutout built from F814W, F125W 
and F160W CANDELS frames and F160W cutout for our sample of compact, star-forming galaxies with 10.6 < log M^/Mq < 11.0. The 
angular size of each cutout is 3.7" X 3.7". 



SFR given by FAST. The adopted SFH are schematically 
presented in Figure |9] 

The z > 3 sample and its evolution at z — 2.3 are 
shown in the sSFR-M* plane in the four panels of Figure 
[TOl (we omit the truncated delayed-exponential case as 
its results resemble those from the truncated exponential 
SFH). 

The panels in Figure [TO] show that galaxies with 
sSFR< 10~ 9 yr -1 do not significantly increase their 
stellar mass with time due to their low SFRs, but 



instead they keep approximately the same value, or 
slightly decrease it, due to the return of stellar mass to 
the gas phase in the ISM. The largest contribution to 
the change in number density in the evolution of the 
population of MQGs comes mainly from the decreased 
value of the SFR at z = 2.3, rather than from an 
increase in stellar mass with cosmic time. 

In the following section, we present the number den- 
sities of massive, quiescent galaxies; the number densi- 
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Figure 8. The panels on the left show the stacked image for 
three out of the five compact star- forming galaxies with high stellar 
mass at z > 3, obtained after excluding the two galaxies with very 
different aspect ratio and position angle (top) and the residual 
image resulting from the fit (bottom). The panels on the right 
show on top the circularized radial profile (arbitrary units) of the 
stacked image indicated by the black points. The triple-dotted 
dashed red curve marks the PSF-convolved Sersic profile recovered 
by GALFIT, while the blue dashed line represents the profile of the 
PSF. The bottom panel shows the residuals measured directly from 
the GALFIT residual image (black points) and smoothed difference 
between the PSF-convolvcd analytic profile and the original input 
image (solid red line). The light profile closely resembles that of 
the PSF and it does not show evidence for a faint extended halo. 
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Figure 9. Examples of the SFHs resulting from the different evo- 
lution scenarios assumed in this work. Left panel: From top to 
bottom, are the delayed-cxponential SFH, the constant SFH and 
the truncated delayed-cxponential SFH. Right panel: from top to 
bottom, the exponential SFH and the truncated exponential SFH. 
The adopted parameters are representative of the average z > 3 
population for all but the truncated-exponential SFH: e-folding 
time (t) of 8 X 10 7 yr, and age of 3 X 10 s yr. The age of the 
truncated-exponential SFH corresponds to 2 X 10 7 yr. Arbitrary 
normalization and offset were applied in order to increase readabil- 
ity. 

ties of the compact, massive, quiescent population is dis- 
cussed in Section 14.31 As a further and more consistent 
source of comparison, the number density of z > 3 mas- 
sive galaxies and of z = 2.3 massive, quiescent galaxies 
was also computed using the full NMBS catalog. These 
two populations were selected according to the same lim- 
its in stellar mass and SFR used for the other samples, 
i.e. log(M*/M Q ) > 11 and log(sSFR/y r - 1 )< -11. The 
full list of the number densities computed for the differ- 



ent choices of SFH can be found in Table [TJ The co- 
moving volume was computed assuming a redshift range 
2.8 < z < 4 . 0. W e used upper and lower error bars 
from IGehr cls (1986;) and added in quadrature the cos- 
mic variance. Figure [3] shows that there are 6 objects 
with z < 2.8, although all of them have stellar masses 
ill* < 10 11 M Q . We repeated our analysis excluding the 
6 objects with z < 2.8 and obtained results quantita- 
tively similar to those presented here. 

4.2. The evolution of massive z > 3 galaxies to z ~ 2 

In order to explore how our results depend on different 
SFH in SED modelling, we adopt a different SFH for the 
past and future history of galaxies in each panel of Figure 
1101 Specifically, for the exponential and truncated expo- 
nential SFH, we used stellar masses and SFRs com puted 
adopting an exponential SFR and a lSalpeterl ([1955) IMF, 
and converting the SFRs and stellar masses to a Kroupa 
IMF by subtracting 0.2 dex. The location of the start- 
ing galaxies does not depend much on the SFH used to 
compute the stellar masses and SFRs, meaning that the 
choice of SFH primarily effects the evolved galaxies. 

The number density tim of massive (M* > 10 n M©) 
z > 3 galaxies from our sample is Um — 14.9j"g'| x 
10 -6 Mpc~ 3 ; Figure [TT] shows that it is fully compat- 
ible with the num ber density of z y 2.2 — 2.5 quies- 
cent galaxies from IBrammer et al.l (|2011f) . In partic- 
ular, this implies that the whole population of mas- 
sive z > 3 galaxies needs to have its SFR quenched to 
log(sSFR/yr _1 ) < -11 by z ~ 2. 

The comparison of the number density of z > 3 mas- 
sive galaxies with z = 2.3 massive, quiescent galaxies, 
both computed using the full NMBS catalog, confirms 
the above result. 

This is further supported by our SFH predictions. The 
blue bar in the left panel of Figure [TT] represents the 
number density of galaxies which, evolved from their ob- 
served redshifts, are massive and quiescent by z = 2.3. 
The box takes into account the spread in values from 4 
of the 5 adopted SFHs (excluding the CSF), and cos- 
mic variance. The blue box is in good agreement with 
measu rements of quiescent galaxies from IBrammer et al.l 
(|2011| ) at z ~ 2 — 2.5. Over-plotted are also the val- 
ues from the individual SFHs. In general, apart from 
the constant SFH case, it is not possible to discriminate 
amongst the SFHs, mainly due to the high cosmic vari- 
ance errors (« 0.35 relative error). As shown in the left 
panel of Figure [UJ 4 out of 5 of the SFHs do equally well 
in reproducing the number counts of MQGs at z ~ 2.3. 
The only exception is the CSF that underpredicts the 
observed number density of MQGs at z ~ 2 by a fac- 
tor of ~ 4. This suggests that the only valid SFHs are 
the declining or truncated SFHs and that the constant 
SFH is not a valid option in this range of redshift. On 
the other side, given that the number densities of mas- 
sive, star-forming galaxi es at z ~ 2 is approxim ately the 
same as that of MQGs {B rammer et alj|20111) . the fact 
that the CSF SFH underpredicts the number densities of 
MQG means that the CSF SFH would underpredict also 
the number density for the massive star-forming galaxies 
at z ~ 2. This may suggest that high-redshift massive 
star-forming galaxies are character ized by rising SFHs, in 
agreement with recent works (e.g. iMaraston et al J 120101 : 
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Figure 10. Evolution of the z > 3 galaxy sample in the sSFR-Af* plane to z = 2.3 for 4 different SFHs (left to right, top to bottom: 
constant, exponential, delayed-exponential and truncated exponential SFHs). Red big squares mark compact (i.e. R e < 2 kpc) z > 3 
galaxies, while blue smaller squares indicate the remaining z > 3 galaxies. The position of each galaxy after evolution is indicated by 
the green triangles and yellow circles for descendants of compact and extended galaxies, respectively. The filled area marks the selection 
criteria adopted for the massive (M, > IO^Mq) and quiescent (sSFR < f0~ n yr — 1 ) sample at z = 2.3. The vertical dashed line indicates 
the stellar mass limit corresponding to our 70% completeness. Barring significant size evolution, the descendants of the red squares that 
fall within the shaded box tell us the number of predicted z ~ 2 compact, massive, quiescent galaxies. The number of galaxies in the 
shaded box is possibly a lower limit as they are the result of evolving the full z > 3 sample and not only those galaxies from the stella-mass 
complete sample. In particular, there may be galaxies with stellar masses below our completeness limit at z ~ 3 which could nonetheless 
grow above the limit by z ~ 2.3. The largest contribution to the change in number density in the evolution of the population of massive 
quiescent galaxies comes mainly from the decreased value of the SFR at z = 2.3, rather than from an increase in stellar mass with cosmic 
time. 



iPapovich et"aLll20TTI). 

Interestingly. iBell et al.l ([201 ID found a similar result 
when comparing z ~ 1 galaxies to the z — quiescent 
population, with the local population of MQGs approxi- 
mately as numerous as the entire massive population at 
z ~ 1. This could suggest that mechanisms for quench- 
ing, either the same over the history of the Universe or 
of different nature at different epochs, are ongoing more 
or less continuously since z ~ 4. 

4.3. The progenitors of z ~ 2 compact, massive, 
quiescent galaxies 

Several works have demonstrated that the vast major- 
ity o f massive quiescent z ~ 2 galaxies are also compact 
(see[S zomoru et al.l [20121 and references therein). This 
allows us to directly compare our number densities for 
compact, massive, quiesc ent galaxies to the num ber den- 
si ties at z = 2 — 2 . 5 fromlBrammer et al.l (| 2 1 If ) . 

iBezanson et al.l (|2009t ) published an estimate of the 
number density of compact, massive, q uiescent galaxies, 
based on the stellar mass function of iMarchesin i et al.l 



( 2009) . The corresponding value is marked in Figure [TT] 
by the crossed-circle symbol. We increased the original 
error bar to ta ke into account the effects of cosmic vari- 
ance using the iMoster et al.l ([201 ll ) recipe, and adding it 
in quadrature. The final error bar is a factor of « 1.5 
the error bar quoted in the original work, but more rep- 
resentative of the true uncertainties. 

As is seen in the right panel of Figure 111! the num- 
ber density nc of massive galaxies that are compact at 
z > 3 (which also corresponds to the number density 
riQ of massive quiescent galaxies at z > 3) is a fac- 
tor of ~ 3 smaller compared to the number density of 
all massive galaxies. Their evolution to z — 2.3 using 
our chosen SFHs (blue box) produces a number den- 
sity which is approximately a factor of > 4 smaller than 
the number density o f massive quiescent galaxies from 
iBrammer et al.l ([20111 ). and consistent only at a 2a level. 
We can explain the decrease in number density between 
z ~ 3 and z <~ 2 in the context of our evolution mod- 
els: the increase with cosmic time of the fraction of mass 
returned to the ISM decreases the stellar mass. One 
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Figure 11. Left panel: number density of the massive z > 3 sample with overlap with the CANDELS field (blue open circle), and 
computed from the full NMBS catalog (red open circle). The filled r ed square marks the n umber density of z ~ 2.3 massive quiescent 
galaxies computed using the full NMBS catalog. We note that, while Brammer ct al. (2011) number density refers to quiescent galaxies 
selected via the UVJ diagram, our measurement of the z ~ 2.3 massive quiescent population is obtained applying a sSFR< 10 — 11 yr _1 
cut. The blue hashed box at z ~ 2.3 encompasses the range in number density of massive quiescent galaxies as predicted by four of our 
SFH scenarios (DE, TDE, E, and TE) for evolving all galaxies from z > 3 to lower redshift. The vertical extent of the bar also takes into 
account the expected cosmic variance. It is clear from this that there are enough massive galaxies at z > 3 to account for the full massive 
quiescent galaxy population at z ~ 2. The blue filled circle at z = 2.3 represents the number density and total error from the CSF model. 
The total number density of log(M*/Mo) > 11 galaxies at z > 3 is consistent with the population of massive quiescent galaxies at z ~ 2; 
all except the CSFH are plausible SFHs in reproducing the evolution to z ~ 2. Right panel: number density of the original massive z > 3 
sample (open grey and magenta circles for the measurement obtained using our sample and the full NMBS catalog, respectively) and of the 
compact massive z > 3 sample (thick blue plus), which encompasses the same objects as the z > 3 MQ sample, and hence has the same 
number density. The colored boxes at z ss 2.3 mark the number densities of evolved compact, massive, quiescent galaxies under different 
scenarios. The blue box represents pure evolution in the SFHs with no merging. The red box and points represent the evolution after 
galaxies have been allowed to merge once randomly. The green box includes the effect of the SFH, a random merger, and a post-merger 
truncation of the SFH. The blue and the green b ars and the associated points have been shifted by an arbitrary value in z to improve 
readability and should be considered at z = 2.3. Brammer ct al. (2011) data is plotted only for z > 2, i.e. where the vast majority of 
massive quiescent galaxies are also compact. Also plotted is the number density of 2.2 < z < 2.4 massive, quiescent galaxies from the 
full NMBS catalog (red filled square). The number density of the compact, massive, quiescent population of galaxies obtained with pure 
SFH evolution to z = 2.3 is only marginally consistent with previous determinations; a simple merging model produces number densities in 
better agreement with previous determinations. Agreement at the 1 — a level is found with the number density of z ~ 2 massive, quiescent 
galaxies computed from the full NMBS data set for the simple evolution scenario; the simple merging model increases the level of agreement 
of the number densities. We caution however that the number densities of the evolved samples are perhaps lower limits since they are the 
result of evolving all galaxies at z > 3 and not just those for which we are mass complete. Specifically, there may be galaxies below our 
completeness limit at z ~ 3 which could nonetheless grow above the limit by z ~ 2.3. 

possible source of th e discrepancy be t ween our measured 
number density and lBrammer et"aLl (1201 lh could be the 
different criteria adopted by Brammer to se- 

lect the quiescent po pulation (the UVJ c olor selection - 
iWilliams et~aTl I2009D . In a recent work, iSzomoru et al.l 
( 20121) showed that the fraction of quiescent galaxies se- 
lected at z > 2 using the UVJ method is equivalent to 
a sSFR< 10 _10 yr~ 1 criterieQ. It is worth noting that 
we do not find z > 3 compact, massive, star-forming 
galaxies, and specifically compact, massive galaxies with 
— 11 < log(sSFR/yr _1 ) < —10 which, following our 
evolutionary models, would eventually become compact, 
massive, quiescent by z ~ 2. This means that our mea- 
surement of the number density of compact, massive, qui- 
escent galaxies at z ~ 2.3 does not change if we use a 
sSFR that is compatible with that for a UVJ cut. 

When we compare the expected number density of 
CMQ galaxies that have been evolved from z ~ 3 with 
the measured sample of z — 2.3 MQ galaxies from NMBS 
(nearly all of which are compact), we find that the two 



7 We also selected quiescent galaxies in the full NMBS data 
set using the UVJ color-color technique. The sSFRs for quiescent 
galaxies at 2 < z < 2.5 and with log(M*/M@) > 11 are lower than 
log(sSFR/yr- 1 )= -9.84, with a 75% upper limit log(sSFR/yr _1 )= 
-10.3 and a median log(sSFR/yr _1 )= -11.11 



agree to within one sigma. 

The plot sho w s also that the number density from 
IBezanson et al.l ([2009) is higher than our measure- 
ments, and compatibl e only with the upper end of the 
iBrammer et al.l ( 20111) . One possible rea s on for this dis- 
crepancy could be that IBezanson et al.l (|2009ft assume 
the fraction of massive, quiesc ent galaxies to be 0.5, likely 
a too optimistic choice (e.g. IDommguez Sanchez et al.l 

unni). 

Similar to the left panel, in the right panel of Figure 
1111 the number densities from the individual SFHs are 
marked. For the compact massive galaxies, the number 
density from the CSFH is compatible with the measure- 
ments from th e other SFHs, in agreement with recen t 
works (see e.g. iGonzalez et aD201tt iReddy et al.ll2012l ). 

4.3.1. The effects of mergers 

In order to try to understand the possible mechanisms 
which could be responsible for the build up of the com- 
pact, massive, quiescent population at z ~ 2, we imple- 
mented a very simple statistical model for how mergers 
could affect the evolution in the observed number densi- 
ties. Our toy model consists of randomly choosing pairs 
of galaxies among the 110 galaxies with z > 3 and with 
a measurement of the R e and leaving the SFH of each 
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Table 1 

Number densities for the galaxy populations and SFHs adopted in this work. 
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(a) Symbols refer to: M = massive galaxies (log(M„/A/Q) > 11), C = compact galaxies (R, 
(log(sSFR/yr _1 ) < — 11) and combinations thereof. 

(b) The co-moving volume for the z > 3 and z = 2.3 populations is computed adopting the redshift range 2 
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galaxy to evolve independently of its companion. The 
stellar mass of the merged pair was finally considered as 
the algebraic sum of the two components; the sSFR was 
computed as the mass-weighted sum of each component 
(or, equivalently, as the sum of the two SFRs divided by 
the total stellar mass), while we assumed for the R e of 
the merged pair the R e of the more massive companion. 

The effect of merging on the number density evolution 
are represented in the right panel of Figure [TT] by the 
red box; the result is an agreement between the observed 
number density of compact MQGs at z ~ 2 and that 
from our models, for all the SFH but the CSF. In this 
case, in fact, the number density is even smaller than the 
number density of the population without merging. In 
the framework of our toy model, this is due to the high 
probability that each pair contains a galaxy with high 
SFR. The end product is then biased towards high sSFR 
galaxies at the end of the merging process. 

Given the potential effect of a starburst or AGN that is 
triggered by the merger, we also examined a scenario in 
which the SF is quenched following the merger. This 
is supported by recent hydrodynamical simulations in 
which mergers play an important role in the gravitational 
heating of the halo gas, and consequently in th e suppres- 
sion of star formation ( Johansso n et aLl l^OOQ). However 
we would like to note that this last process does not 
improve the agreement significantly, except for the CSF 
case, for which the number density falls well within the 
values from the other SFH. 

Good agreement between the expected and observed 
number densities for our evolved, merged, and quenched 
model is also found when we compare the expected 
number densities to those calculated from the full 
NMBS sample using a log^SFR/yr^ 1 ) < -11 cut. This 
comparison is useful as the full NMBS catalog has a 
higher precision than our sample that is limited to the 
area with CANDELS overlap. Although the merging 
model appears to be in better agreement with the data, 
it is clear that even a simple evolution model with 
no merging provides an adequate match to the data. 
Therefore, merging and quenching, which are both 



likely processes that are occurring in the galaxy pop- 
ulation, are not required to explain the number densities. 

If the two quiescent galaxies with MIPS detections in- 
deed have the extreme SFRs implied by the rest-frame 
5.6 /im fluxes, our results on the potential importance of 
merging would remain qualitatively unchanged. This is 
because these two galaxies are only 13-20% of the whole 
sample of galaxies that would be classified as massive 
at z ~ 2.3. If the truncated post-merger SFH is correct 
then the results will also remain quantitatively similar as 
the prompt quenching of star formation assumed in this 
model would turn them into massive quiescent galaxies 
by z ~ 2.3. 

5. DISCUSSION 

According to current semi-analytic models for the for- 
mation of massive quiescent galaxies, the bulk of the stel- 
lar mass was formed as a compact, massive spheroid at 
z > 3 through gas-rich merger events (e.g. lOser et ail 
l2010f ). Specifically, in the redshift range between z = 5 
and z = 3, the models predict that the central galaxy 
would still be building up from gas flows which would 
feed the the formation of stars in the central region 
of the g alaxy directly, for ming the concentrated stellar 
system (|Naab et al.ll2009f h In a second stage, quench- 
ing mechanisms such as major merger or feedback from 
AGN or from star formation would convert the full pop- 
ulation of massive galaxies into the population of qui- 
escent massive galaxies observed at later cosmic times. 
Subsequent minor d ry mergers would be responsible fo r 
the increase in size (|Naab et all 120091: IQser etldl 121)121) . 
while keeping the gain in stellar mass to a factor of < 2 
(|van Dokkuni et al.|[2010l) . 

Understanding what are the physical mechanisms of 
star-formation quenching, when they started to act and 
how long they took to completely quench star-formation 
is then a central key in our knowledge of formation and 
evolution of local massive quiescent galaxies. While our 
analysis does not provide us with significant information 
on the physical mechanisms, it allows us to put new con- 
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straints on both when star-formation quenching could 
have happened and on how fast it could be. 

In fact, the compatibility of the number density of mas- 
sive z > 3 galaxies with the number density of z ~ 2 mas- 
sive quiescent galaxies from the literature suggests that 
the population of massive quiescent galaxies at z ~ 2 can 
be completely accounted for by the observed z > 3 popu- 
lation of massive galaxies that is subsequently quenched. 
Specifically, when evolving our sample of z > 3 galax- 
ies to z — 2.3, a good agreement is obtained introducing 
quenching of star-formation after merging, process which 
is supported by recent hydrodynamical simulations. 

Secondly, our observations of compact, massive, quies- 
cent galaxies at 3 < z < 4 push back in time the appear- 
ance of this class of objects to when the Universe was 
- 2 Gyr old. 

Finally, considering that, according to models, mas- 
sive galaxies should still be actively forming stars at 
3 < z < 5, then the observation of compact massive 
galaxies which are already quiescent by 3 < z < 4 
imposes that the quenching of star formation should 
be a rapid mechanism in massive galaxies, acting on 
timescales of less than 1 Gyr in the early Universe. 

6. CONCLUSIONS 

In this work we used two overlapping public sets of data 
in a region of the COSMOS field to identify the progeni- 
tors at z > 3 of the compact, massive, quiescent galaxies 
observed at z ~ 2. Stellar masses, sSFRs and photomet- 
ric redshifts were taken from the NMBS. The sizes were 
measured on high-resolution CANDELS F160W images. 
The population of z > 3 galaxi es was evolved to z = 2.3 
using Bruz ual fc Chariot! ()2003[ ) models following five dif- 
ferent SFHs (constant, delayed-exponential, truncated 
delayed-exponcntial, exponential and truncated expo- 
nential) . All the number densities were computed assum- 
ing a co- moving volume corresponding to 2.8 < z < 4.0. 
Our main results can be summarized by the following 
points: 

1. We discovered four compact, massive 
(M* > 10 n M Q ) quiescent (SED-based 
sSFR< 1CT 11 yr" 1 ) galaxies at z > 3, corre- 
sponding to a completeness-corrected number 
density of n Q = 5.2±|;^ x 1(T 6 Mpc~ 3 . If the two 
galaxies with MIPS detection were excluded, the 
resulting completeness-corrected number density 
would be n Q = 3.6±2.4 x 10~ 6 Mpc~ 3 . 

2. For a complete sample of 10 galaxies with 
log M*/Mq > 11, we found that the quiescent 
(sSFR< 10~ n yr _1 ) galaxies are compact (R e ~ 
1.2 kpc), while the star-forming galaxies are ex- 
tended (i? e ~ 3.1 kpc), qualitatively similar to 
what is found at lower redshifts (z ~ 2.3). If the 
two quiescent galaxies with MIPS 24 /xm detection 
were considered star forming, the averaged sizes of 
the quiescent and star-forming galaxies would be 
R e = 0.6 kpc and R e = 2.8 kpc, respectively. 

3. We found five compact (R e < 1.4 kpc), star- 
forming (sSFR^ 10 -9 yr _1 ) galaxies at z ~ 3 
with large stellar masses (10 10 6 < log(M*/M ) < 
10 110 ) . The small effective radius was confirmed 



by a iSersid (JT968) profile fitting of the stacked im- 
age. 

4. The number density of massive z > 3 galaxies is 
comparable to the number density of z ~ 2 massive 
quiescent galaxies from the literature. The evolu- 
tion of the number density of the z > 3 galaxy 
population to z ~ 2 can be accounted for with a 
family of decaying or truncated SFHs. The CSF 
SFH docs not fit the observed number densities. A 
model with quenching of the SFR between z = 2.3 
and 3 does. 

5. When we evolve our z > 3 galaxies to z = 2.3, 
we find that the predicted number density of com- 
pact, massive, quiescent galaxies is consistent at 
the I — a level for all of our adopted SFHs. An even 
better agreement is obtained if we quench the star 
formation after the merging event. Such a rapid 
truncation is supporte d by recent simulations (e.g. 
[Johansson et al.ll2009| ). 

The above results lead to the following conclusions: 

• The population of massive quiescent galaxies at 
z ~ 2 can be completely accounted for by the ob- 
served z > 3 population of massive galaxies that is 
subsequently quenched. 

• The existence of compact, massive, quiescent 
galaxies at 3 < z < 4 pushes back in time the 
appearance of this class of objects to when the Uni- 
verse was ~ 2 Gyr old. 

• Since galaxy formation models predict that at 3 < 
z < 5 galaxies should still be building up from gas 
flows, the existence of compact, massive galaxies 
that are quiescent already at 3 < z < 4 implies 
that the quenching of star formation should be a 
rapid mechanism, acting on timescales of less than 
1 Gyr in the early Universe. 

We note that half of the sub-sample of massive and qui- 
escent galaxies is detected in MIPS, implying large SFRs 
if the MIR emission were associated to dust-enshrouded 
star formation. Since the MIPS band probes rest-frame 
wavelengths shorter than 6 /xm at z > 3 (i.e., emission 
from hot dust), SFRs derived from MIPS are very un- 
certain and can potentially be contaminated by emission 
from a dusty torus of an AGN. Observations in the far-IR 
(e.g., ALMA) are needed to robustly quantify the level 
of obscured star formation and to confirm the quiescent 
nature of these galaxies. 

The analysis presented in this work is based on a sam- 
ple size of about one hundred objects, resulting from the 
intersection of two catalogs, CANDELS and NMBS on 
the single CO SMOS field. Recent proj e cts like the 3D- 
HST survey (|van Dokkum et all 120111 : IBrammer et al.l 
I2012t ) will provide accurate redshift for ~ 7000 objects 
at 1 < z < 3.5 over a total area a factor of ~ 5 larger 
than the one available for this work. This will reduce 
by a large amount both the Poisson noise and the un- 
certainties due to the cosmic variance. Additionally, the 
increase in photometric depth from both CANDELS and 
the 3D-HST with respect to the currently available data 
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sets will also allow us to probe the population of galaxies 
down do smaller stellar masses. These improvements will 
contribute significantly in the near future to further un- 
derstanding the build-up and evolution of massive galax- 
ies. 
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